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Abstract 

We introduce a model of computation based on quaternions, which is inspired on the 
quantum computing model. Pure states are vectors of a suitable linear space over the 
quaternions. Other aspects of the theory are the same as in quantum computing: super- 
position and linearity of the state space, unitarity of the transformations, and projective 
measurements. However, one notable exception is the fact that quaternionic circuits do 
not have a uniquely defined behaviour, unless a total ordering of evaluation of the gates 
is defined. Given such an ordering a unique unitary operator can be associated with the 
quaternionic circuit and a proper semantics of computation can be associated with it. 

The main result of this paper consists in showing that this model is no more powerful 
than quantum computing, as long as such an ordering of gates can be defined. More 
concretely we show, that for all quaternionic computation using n quaterbits, the behaviour 
of the circuit for each possible gate ordering can be simulated with n + 1 qubits, and 
this with little or no overhead in circuit size. The proof of this result is inspired of a 
new simplified and improved proof of the equivalence of a similar model based on real 
amplitudes to quantum computing, which states that any quantum computation using n 
qubits can be simulated with n + 1 rebits, and in this with no circuit size overhead. 

Beyond this potential computational equivalence, however, we propose this model as a 
simpler framework in which to discuss the possibility of a quaternionic quantum mechanics 
or information theory. In particular, it already allows us to illustrate that the introduction 
of quaternions might violate some of the "natural" properties that we have come to expect 
from physical models. 



1 Introduction 

Quantum Computing represents yet another disconcerting puzzle to Complexity Theory. What 
we know today is that quantum computing devices can efficiently solve certain problems, which, 
in appearance, classical or probabilistic computers cannot solve efficiently. Even though we 
would like to believe that quantum computing violates the strong Church- Turing thesis, the 
sore truth is that the known results do not provide us a proof, only constituting, at best, "strong 
evidence" thereof. 

Yet, even though we cannot provide a strict separation between these models, we do know 
certain inclusions between variations of these computing models. Perhaps the most natural 
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variation from standard Quantum Computing is that in which we change the domain of the 
state vector amphtudes, and hence the domain of their allowed linear transformations. 

It was first shown that restricting ourselves to real amplitudes does not diminish the power of 
quantum computing [7], and further, that in fact rational amplitudes are sufficient [1]. Both 
these results were proven in the Quantum Turing Machine model, and the respective proofs are 
quite technical. Direct proofs of the first result for the quantum circuit model stem from the 
fact that several sets of gates universal for quantum computing have been found [14, 8, 19, 18], 
which involve only real coefficients. 

In this paper, we introduce another possible variation on quantum computing involving quater- 
nionic amplitudes, and prove an equivalence result that shows that no further computational 
should reasonably be expected in this model. In Section 2, we will start by redefining quantum 
computing in an axiomatic fashion, which will make it possible to easily generalise the model 
for other non-complex Hilbert spaces. We will redefine and review the results known for com- 
puting on real Hilbert spaces in Section 3, also providing a new generic and structural proof 
of the equivalence of this model to standard complex quantum computing. We will introduce 
the quaternionic computing model in Section 4, discuss some of its peculiarities, and then show 
how the above proof can be easily adapted to the quaternionic case. In Section 5, we discuss 
some of this result in terms of computational complexity and also of the particularities of the 
quaternionic model on in its possible "physical" interpretations. Finally, we summarise our 
conclusions and propose further open questions in Section 6. 

2 Quantum Computing Revisited 

The basic tenets of Quantum Computing, follows: 

States. The pure states describing the internal configuration of an n qubit computing device 
are defined as 1-dimensional rays in a 2^-dimensional vector space over the complex 
numbers. Over such a vector space, the usual inner-product defines the standard L2- 
norm, which in turn defines a proper Hilbert space^. With respect to this norm, states are 
normally represented as unit vectors, up to an arbitrary phase factor e*^, with < 6* < 27r. 

Measurement. The canonical basis of this vector space is given special meaning, and called 
the computational basis, in that it represents states which always give the same outcome 
when "queried" about their information content. The states are usually labelled by n- 
bit strings b = &i . . . For a generic pure state |$), the probabilities of measurement 
outcomes are given by the following rule 



where \b) is some computational basis vector. 

^This is only true because we are considering finite dimensional inner-product spaces, which are trivially 
complete. 




(1) 
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Transformations. Generally speaking, the transformations that are allowed are linear map- 
pings. In addition, in order for the quantities above to be proper probabilities, these 
transformations must preserve L2-norm. The only relevant linear and L2-norm preserv- 
ing operations are unitary transformations^. These are usually represented in the matrix 
form in which the column vectors are the images of the canonical basis under the given 
transformation, listed in lexicographical order. 

Circuits. The computational device is modelled as a circuit, which, without loss of generality, 
can be assumed to have the following characteristics 

• The input to the circuit is any pure state. 

• The circuit is an array of elementary universal gates. 

For example, we can choose the 2-qubit CNOT gate and arbitrary 1-qubit rotations as 
a universal set. Furthermore, we allow gates to operate on any two arbitrary wires, not 
necessarily contiguous^. 

Algorithms. A quantum algorithm can be formally described as a classical Turing Machine, 
which given a classical string x will generate a (classical) description for a quantum circuit. 
The quantum computer can then produce an answer based on the result of measurements 
of the output wires of the quantum circuit. Without loss of generality, we can assume 
that the circuit is to be evaluated with the ground state (the all zero computational base 
vector) as its initial state. The algorithm is said to be efficient if the corresponding TM 
runs in time polynomial on the size of the input x, which in turns implies that circuit 
size is also polynomial. 

From a purely abstract point of view, it can be inferred that the only requirements of this 
model is that the state space has a linear structure and a proper norm-inducing inner product, 
so that the measurement rule is always sound. Traditionally, quantum computing has been 
described in terms of complex Hilbert spaces, but in principle, as we just discussed, a sound 
model of computation can be defined on any other Hilbert space. In particular, in this paper 
we study models of real computing and quaternionic computing, based on the 2"-dimensional 
vector spaces on the reals and the quaternions, respectively. 

3 Real Computing 
3.1 Definitions 

Intuitively, the real computing model is defined as a restricted version of quantum computing, 
where all amplitudes in the state vectors are required to be real numbers. Conjugation is 

^Anti-unitary transformations also preserve L2-norm, but do not preserve inner-product and arc not usually 
considered as legal quantum transformations. 

■^This is not the usual model, in which gates are restricted to act on contiguous wires. However, this model 
is not more powerful than the later, since it can be simulated efficiently with at most a quadratic number of 
swap gates. 
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equivalent to the identity operation and bras are simply transposed kets. Similarly the matrix 
dagger operator C") can be replaced with the matrix transpose operator (*). 

In this case, we must replace unitary transformations with orthonormal transformations, as 
these are the only inner-product preserving operations on this inner-product space. One could 
conceive a model in which the state vectors always have real amplitudes, but in which arbitrary 
unitary transformations (on the complex Hilbert space) are allowed, as long as the end result 
is still a real amplitude vector. It is elementary to show that orthonormal transformations are 
the only ones that have this property, and hence this model is as general as can be, given the 
fact that we insist that the amplitudes be real. 

Rebits and States 

In quantum computing and quantum information theory, we define the qubit as the most 
elementary information-containing system. Abstractly, the state of a qubit can be described 
by a 2-dimensional state vector 

1$) = a\0) +f3\l), s.t. II $ II2 = V|a|2 + |/3|2 = 1 (2) 

where |0) and 1 1) are the two canonical basis vectors for such a 2-dimensional space. Two vectors 
1$) and 1$') are said to represent the same qubit value if they are in the same 1-dimensional 
ray. In other words, 

$ = $' 1$) = e^''|$'),where 9 e [0,27r). (3) 

Definition 1 (Rebit). The corresponding concept in real computing is called a rebit. As in 
Equation 2, its state can also be described by a 2-dimensional vector on the real Hilbert space 

1$) = a|0) + 6|1), s.t. II $ II2 = + 62 = 1 (4) 

In this case, the arbitrary phase factor can only be +1 or —1, and the rebit equivalence relation 
which replaces Equation 3 is 

$ = $' ^ 1$) = e^^|$'),where ^ e {0,7r} (5) 
^ 1$) = ±1$') (6) 

Similarly as for qubits, single rebit states do have a nice geometrical interpretation: they are 
isomorphic to the circumference, having |0) and |1) at opposite extremes. One way to see this 
is to consider the locus of points in the Bloch sphere for which e'^ = 1, or in other words, those 
with no circular polarisation. Unfortunately, there is no such nice geometric representation of 
an arbitrary ra-qubit state, and we believe the same is true for n-rebit states. 

The computational basis vectors for a rebit are still |0) and |1), and for arbitrary n-rebit systems 
they can also be represented as n-bit strings. The measurement rule in defining the probabilities 
of obtaining the corresponding bit string as a result is essentially the same as Equation 1, 

Pr(|$) ^ "b") = |($|fe)|2 = ($|6)2 (7) 
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where in this case we can drop the modulus operator |-|, because it is redundant. 

One physical interpretation that can be given for rebits or rebit systems is that of a system of 
photons, where we use the polarisation in the usual manner to carry the information. However, 
these photons are restricted to having zero circular polarisation, and being operated upon 
with propagators which never introduce circular polarisation, i.e. orthonormal operators. The 
computational basis measurements are still simple polarisation measurements in the vertical- 
horizontal basis. 

Real Circuits and Real Computational Complexity 

We can also define and construct real circuits, as a restriction of quantum circuits. Topologically, 
they are the same, as we will still require them to be constructed only with reversible gates. 
Since orthonormal matrices, like unitary matrices, are preserved under the tensor algebra that 
describes circuit constructions (see [5, 6] for more details on this formalism), it is sufficient to 
require that the elementary gates be orthonormal. With this, we are assured that the overall 
circuit transformation will be norm-preserving. We can then define a measurement rule for 
circuit states, which will yield classical results with probabilities exactly as in Equation 7. As 
was noted before, this rule is completely general and does not depend on the field on which the 
inner-product space of states is defined. 

Real Algorithms 

To complete the definition of this computational model, we must define what it means for such 
real computing devices to "compute" or to "solve a problem." For that, we simply restrict the 
definition of a quantum algorithm given above. 

Definition 2 (Real Algorithm). A real algorithm is defined as a classical TM, which on 
(classical) input x will generate a (classical) description of a rebit circuit. The result of mea- 
surement of the final state |$) of the rebit circuit is post-processed by the TM to produce its 
final (classical) answer. 

The TM can be viewed as having access to a universal circuit evaluator or oracle, which will 
produce a classical answer b, with the probabilities defined in Equation 7. It is important 
to note that no matter what classical post-processing the classical Turing Machine does after 
obtaining an answer from the Oracle, its final answer ultimately only depends on the outcome 
probabilities. In other words, from the TM's point of view, it does not matter if the circuit is 
physically constructed or just simulated by the Oracle, nor does it matter what technology was 
used or what mathematical abstraction was employed in its simulation. What matters is that 
the outcome probabilities of the Oracle be the same as those of circuit description provided by 
the TM. 

3.2 Previously Known Results 

From a Complexity Theory point of view, the first question that arises naturally is how does 
this real computing model compare with the quantum computing one. In other words, can the 
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problems which are efficiently solved by a quantum algorithm also be solved by an efficient real 
algorithm? 

For the Quantum Turing Machine model, the answer was previously known to be "Yes" . Even 
though, it is not explicitly stated as such, the following theorem is traditionally attributed to 
Bernstein and Vazirani, as it can be easily deduced from the results in [7]. 

Theorem 1 (Bernstein, Vazirani). Any Quantum Turing Machine can be approximated 
sufficiently well by another, whose transition matrix only contains computable real numbers of 
the form ±cos{kR) and sin{kR), where k is an integer and 

oo 
i=l 

The need for having such transcendental amplitudes was eventually removed. By using tran- 
scendental number theory techniques, Adleman, Demarrais, and Huang showed in [1], that, in 
fact, only a few rational amplitudes were required, in particular only the set {0, ±1, ±3/5, ±4/5}. 

It is important to note that Theorem 1 does not apply directly to circuits, or at least not 
in a completely trivial manner. The constructions in the proof are relatively elaborate and 
rely heavily on techniques of Turing Machine engineering. Nonetheless, quantum circuits were 
shown to be equivalent to Quantum Turing Machines by A.C-C Yao in [21]. In principle, the 
construction of that proof could be used to show that quantum circuits do not require states 
with complex amplitudes to achieve the same power as any complex-valued circuit or QTM. 

However, the celebrated universality result of Barenco, Bennett, Cleve, DiVicenzo, Margolus, 
Shor, Sleator, Smolin, and Weinfurter [1] provides a first step towards a proof of that fact, 
as they show that CNOT and arbitrary 1-qubit gates form a universal set of gates for quantum 
circuits. While arbitrary 1-qubit gates can contain complex amplitude transitions, more recent 
results have produced ever smaller sets of universal gates, which are comprised only of real 
amplitude transitions. The following is just a sample list of such results: 

• TOFFOLI, HADAMARD, and 7r/4-rotation, by Kitaev [11] in 1997. 

• CNOT, HADAMARD, 7r/8-rotation by Boykin, Mor, Pulver, Roychowdhury, and Vatan [S] in 
2000. 

• TOFFOLI and HADAMARD, by Shi [19] in 2002, with a simpler proof by Aharonov [3] in 2003. 

• Controlled ^-rotations, by Rudolph and Grover [IS] in 2002 

The motivation behind these results was to come up with the simplest possible gates, given the 
fact that quantum states in nature can and will have arbitrary complex amplitudes, and thus, 
so will their unitary propagators. The fact that the simpler sets involve only real numbers was 
a priori just a "desirable side-effect." Our motivation, however, is completely different. We 
play a different game: suppose that all we had were these mysterious "rebits," unable to enter 
complex amplitudes. What could we do then? Because of this motivation, our proof will have 
a different flavour. In fact, the proof is completely general in that it works with any universal 
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set of gates. In particular, it will work with gates which have arbitrary complex transition 
amplitudes. In other words, in proving the following, more general theorem, we will completely 
ignore the above results. That will allow us to recycle its proof later on in Section 4. 

Theorem 2. Any n-quhit quantum circuit constructed with gates of degree d or less (possibly 
including non-standard complex coefficients gates) can he exactly simulated with an n + 1 rebit 
circuit with the same number of gates of degree at most d + 1. 



3.3 A New Proof of Equivalence 
3.3.1 The Underlying Group Theory 

The idea behind the proof is to make use of the fact that the group SU(A^) can be embedded 
into the group S0(2A^). We provide an explicit embedding h.^ While this mapping is not 
unique, what is special about it is that it has all the necessary properties for us to define a 
sound simulation algorithm based on it. This mapping is defined as follows. Given an arbitrary 
unitary transformation U, its image O = h{U) is 



f/ A O = h{U) 



Re(f/) 


Im(f/) 


- \m{U) 


Re(t/) 



(8) 



where the Re and Im operators return the real and imaginary parts of a complex number, 
respectively, and applied to complex matrices, return the matrix composed of the real and 
imaginary parts of each entry. Note also, that if we define the following formal tensor 

we can express the definition of h more simply as 

U Ao = h{U) = T ®U (10) 



The first fundamental property that this mapping must have for us to use it effectively in a 
simulation is the following. 

Theorem 3. Let Gn represent the image of SU(A^) under h. Then h is a proper group 
isomorphism between SU(A^) and Gn, and Gn is a subgroup of SO(A^). 

Proof. It is easy to see that any matrix in Gn, which will have the form of Equation 8, will 
have a unique inverse image, and hence that h is an injective mapping. The following lemma 
is sufficient to show that /i is a group homomorphism. 

Lemma 1. Let A and B be any two arbitrary N x N matrices, then h{AB) = h{A)h{B). 

Independently, Aharonov [.!] has also used this mapping recently to provide a simple proof that TDFFDLI 
and HADAMARD are universal. 
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Proof. The first step is to obtain a simple matrix multiplication rule for matrices, using the 
operators Re and Im. For arbitrary complex numbers a and (3, we have that 



Re{ap) = Re(a) Re{p) - Im(a) lm{p) 
lm{a(3) = Re(a) lm{/3) + Im(a) Re(/3) 



'11^ 



Since these rules hold for the products of all of their entries, it is then easy to see that this same 
multiplication rule will also hold for complex matrices. In other words, we can substitute a and 
(3 in Equation 11 with any two arbitrary complex matrices A and B which are multipliable, to 

get 



Re{AB) = Re{A) Re{B) - lm{A) lm{B) 
lm{AB) = Re{A)lm{B) + lm{A)Re{B) 

We are now equipped to verify our claim 

h{A)h{B) = {r®A){T®B) 



(12) 



Re{A) 


\m{A) \ 


f Re{B) 


Im(fi) 


- Im(A) 


Re (A) J 


\ -Im(5) 


Re(5) 



Re{A) Re{B) 


- Im(A) Im(5) 


Re{A) lm{B) + Im(A) Re{B) 


- Im(A) Re(E) 


- Re(A) \m{B) 


- Im(A) Im(5) + Re(A) Re(E) 



Re{AB) 


\m{AB) 


- \m{AB) 


Re{AB) 



r®AB = h{AB) 



(13) 
□ 



Finally, we want to show that Gn C S0(2A^). This is equivalent to showing that all the images 
O = h{U) are orthonormal, i.e. that = 0~^. Since by Lemma 1 /i is a group homomorphism, 
it maps inverse elements into inverse elements, i.e. h{U~^) = h{U)~^ . Since U is unitary, we 
have that 

= h{U)-^ = h{U-^) = h{U^) (14) 
while the following lemma will give us an expression for O*. 
Lemma 2. Let A he an arbitrary N x N complex matrix, then h(A^) = /i(A)*. 

Proof. By definition of h and by transposition rules of block matrices, we have 

h{AY = {T®Ay 



Re (A) 


lm{A) \ 


-Im(y4 
Re(A)* 


) 


Re (A) J 
-lm{Ay 


ImiAf 
Re (A 




ReiAf 
Im(At) 


- Im(A^) 


Re(A^) 



T®A^ = h{A^) 



(15) 
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where we also used the following generic matrix identities 



Re(A^) = Re{Af 

lm{A^) = -lm{A)\ (16) 

□ 

In particular, we have that = h(UY = h(U^) = h(U~^) = 0^^, and we are done proving 
Theorem 3. □ 



The fact that h is a group isomorphism is important, because it implies that is preserved 
under "serial" circuit construction. In other words, it means that if we have real circuits that 
simulate the quantum circuits with operators U and V, then we can simulate a quantum circuit 
with operator UV by simply putting both real circuits together. This suggests a way in which 
to decompose the problem of simulating a generic quantum circuit, i.e. by constructing the real 
circuit one level at a time. 



3.3.2 The Simulation Algorithm 

Let C be a generic n-qubit quantum circuit with operator Uc, composed of s elementary gates. 
The simulation algorithm will consist of the following steps: 

Step 1. Serialise the given circuit by finding an ordering of its gates, so that they can be 
evaluated in that order, one by one. In other words, find a total order of the circuit 
gates, such that Uc = f/(^)f/(^-i) . . . f/(2)[/(i). 

Step 2. For each gate g E {1, . . . , s} in the above ordering, replace the n-ary operation U^^\ 
corresponding to the ^f-th gate, with an adequate real circuit O^^-* simulating it. 

Step 3. Construct the overall real circuit C by concatenating the circuits for each level g, in 
the same order as defined in Step 1. This is, if Oc is the operator for C, then let 

Step 4. Write a description of the real circuit C and of its input state and ask the real 
computing "oracle" to provide the result of a measurement on its final state. 

Step 5. Perform the classical post-processing on the result of the measurement and provide a 
classical answer. 

The algorithm, as described so far, is not completely defined. In what follows, we will derive, 
one by one, the missing details. 

First, the total order in Step 1 can be obtained by doing a topological sort of the circuit's 
directed graph. This can be done efficiently in time polynomial in the size of the circuit^. The 
effects of Step 1 on C are depicted in Figure 1. 

^These orderings, because of the fact that they can be found efficiently, are the base of the "strong" equiva- 
lence of the circuit and the Turing Machine models of computation. 
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Figure 1: Serialisation of the quantum circuit C in Step 1. 



3.3.3 Constructing the Real Circuit 

In principle, each of the elementary quantum gates g is described by a unitary operator defined 
on the (i-qubit complex Hilbert space. We can assume without loss of generality that these 
gates are described in the input to the simulation algorithm as 2*^ x 2'^ matrices^, which we 
denote with subscripted capitals. Thus, the g-th gate has associated to it a rf-ary gate operator 
Ug (with typically d = 1,2). 

However, in the context of a circuit the operator fully describing the action of gate g is an 
A^-ary operator acting on all n qubits, which depends not only on Ug but also on the positions 
of the wires on which g acts. We denote this operators with superscripted capitals. Thus, after 
serialisation of the circuit C in Step 1, these operators U^^^ will correspond to the g-th level of 
the serialised version of C. 

In general, the g-th gate will be a ci-ary gate operating on wires with indices ji < j2 < ■ ■ ■ < jd, 
not necessarily contiguous, with the associated circuit operator U^^\ which will depend on 
ji, . . . ,jd- For example, in the case of a 2-qubit gate g operating on the j-th and k-th wires, 
l<J<k<n, f/(») can be expressed in terms of its elementary gate Ug as follows 

U^'^\j,k) = S^,S2,k{Ug®h.2)S2,kSl,, 

^ Sg{Ug®In-2)Sg (l7) 

where Im is the identity operator for m qubits, Si^j is the n-qubit swap operator acting on wires 
i and j, and Sg = Sj^k is a shorthand for describing the necessary swap operator for the g-th 
gate. The logic behind Equation 17 is explained graphically in Figure 2. Note however, that 
this conversion using swap gates is not itself part of the simulation, but only a mathematical 
convenience to be used later. These swaps gates will not be included in the final real circuit C 
and do not represent a computational overhead. 

As for Step 2, the isomorphism h readily suggests a method for substituting each of the s 
levels of the original quantum circuit C. Let be the 2'^-dimensional complex Hilbert space 
on which Ug acts, and let be the 2'^"'"^-dimensional real Hilbert space on which its image 
Og = h{Ug) acts. If (7 is a (i-qubit gate, then Og operates on d + 1 rebits. We thus have an 

^In fact, what we are given are finite-precision approximations of these matrices. 
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Figure 2: Obtaining an expression for the A^-ary circuit operator U^^\ 



extra wire, and it is not a priori clear how to map the original d quantum wires with these 
d + 1 real wires. To resolve this ambiguity, we need to define how we associate the base vectors 
of with those of H^. 

We use the columns of the tensor T defining h in Equation 10, to define the following mappings 
between and H^. Let |$) be an arbitrary state vector in if^, and let T = [%\Ti], 

1$) |$o) 4 To ® 1$) = ( _ ) ® 1$) (18) 

Note that the images |$o) and are mutually orthogonal in H^. In addition, both Hq and 
hi are proper linear homomorphisms, as can be easily verified given the distributivity of the 
tensor product with matrix addition. 

The base vectors \b) of are column vectors with all zero entries, except with a 1 at the 
integer value j of b; i.e. {j\b) = 1, and {k\b) = 0,k j. Thus, it is easy to see what these basis 
vectors are mapped to: 

\b) ^ |6o) = |0)®|6) (20) 
^ \bi) = \l)®\b) (21) 



These homomorphisms define the semantics to give to each of the d + 1 real wires on which 
Og acts, as is shown in Figure 3. When the original quantum gate takes \b) as input, the 
corresponding real gate Og has two possible base vectors \bo) or as inputs. This corresponds 
to having an extra wire at the top of the gate with value |0) or |1) respectively, and the base 
state \b) in the bottom d wires. Finally, note that since Ug is represented as a matrix of constant 
dimension, then Og is also a small matrix, which can be computed from Ug and written down 
in constant time. 
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Figure 3: Simulation of an individual elementary binary quantum gate, by a tertiary real gate. 
Note that in general, for non classical inputs, the final state of Og cannot be factored like in 
the example shown. 



Even though we have defined how to simulate "out-of-context" rf-ary elementary quantum gates, 
we have not yet explained how to simulate them in their corresponding positions in the circuit 
C. In other words, we still have to describe how to simulate the iV-ary operators [7*^^^ Again, 
the isomorphism h comes to the rescue: we will simulate f/^^^ by finding an (n+ l)-rebit circuit 
that computes its image O*^^-* = h{U^^^) under h. Unfortunately, we cannot simply construct 
this circuit from the matrix definition of h{U^^^), because it is a huge matrix and that would 
require exponential time. However, [7*^^^ is a very simple A^-ary operator: it is after all just a 
d-ary gate, which has a succinct description given by Equation 17. Since it involves at most d 
qubits, then the circuit O^^^ only needs to involve those same wires and one other extra rebit. 

At this point, we have to make a further apparently arbitrary choice, i.e. which one of the 
n — d other available wires will play the role of the "top" rebit for the Og gate? In other 
words, where shall we place the extra wire required for implementing O*-^-*? The answer comes 
from the homomorphisms and hi in Equations (18) and (19), respectively. They are also 
automatically defined on the state space of the whole circuit, and hence they generate the 
same wire semantics as for isolated d-axy gates: the extra wire must be at the top of the circuit, 
as is shown in Figure 4. Similarly, as in Equation 17, we have for the case where (i = 2, an 
expression for O^^-* in terms of Og. 

O(^) (j, k) = S2J+1 ^3,fc+l {Og ® /„_2) Ss,,+i S2,J+1 

^ S'g{0g®h.2)S'g (22) 

where again we define S'^ for convenience, and j and k are the indices of the wires on which 
gate g acts on the original circuit C. 

We now have a simple and well defined scheme for constructing the desired simulating circuit 
C . In Step 3, we will construct C by concatenating the real circuits for the A^-ary operators 
0^^\ One important characteristic of this scheme is that we are reusing the extra wire needed 
for each gate, each time using the same top wire. This is illustrated in Figure 5. Even though 
they act on the whole space the O^^^ operators are simply {d + l)-ary gates put in context, 
and they can be described in a succinct manner requiring only a constant number of symbols. 
Therefore, the overall size of the description for C will be linear in the size of the initial 
description of C which was given as input. 
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Figure 4: Obtaining an expression for the (A^ + l)-ary circuit O^^^ . 
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Figure 5: Simulation of a quantum circuit by a real circuit. 



What is remarkable about this scheme, is that despite its simplicity, it gives precisely what 
we wanted, this is, that the final operator Oq be in some sense as similar as possible to the 
operator Uc of the original circuit. In fact, we have the following third nice property of our 
simulation. 

Lemma 3. The inverse image of Oc is precisely Uc, i-e. Oc = h{Uc)- 

Proof. Because of the serialisation of Step 1, we have that Uc = U^^'' . . . U^'^^U^^\ We use this 
and the group isomorphism properties of h from Lemma 1 to obtain the following expression 
for its image 

h{Uc) = hiU^'K.M^^')) 

= /i(f/W).../i(f/(^)) 

s-l 



n 



i = 0, 
g=s-i 
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We can now use the expression of Equation 17 to substitute for U^^\ 

= n h{Sg{Ug®h^2)Sg) 

= n ^^^9) h{Ug®In^2) KSg) 

Since Sg is composed only of O's and I's, we have that Re(S'g) = Sg and Im(S'g) = 0. Fur- 
thermore, we have that S'g = Ii ® Sg from their definition in Equations (17) and (22), and 
thus, 

= n (^1 ® ^9) HUg ® /n-2) (/i ® Sg) 

= n 5; h{Ug ® In-2) S'g 

However, the tensor product is just a formal operation, and its associativity property holds 
even with a tensor of operators like T. Hence, we have 

= l[S'g[T®iUg®Ir,.,)]S'g 

= n 

= n ^UMf/.)®4-2]5; 

= n ® ^""2) 5; 

which with the padding expression of 0(f) in Equation 22 finally eives 

= H 0^^^ = 0c. (23) 

i = 0, 
g=s-i 

□ 



3.3.4 Circuit Initialisation and Measurement 

Having described how to construct the real circuit C from the original circuit C, we still have 
to address the issue of how to initialise C in Step 4, and furthermore of how to interpret and 
use its measurements to simulate the initial quantum algorithm in Step 5. 

Let |\E') represent the initial state given to C, and let |$) be its image under Uc-, i-e. the final 
state of the circuit before measurement. If we think back of the two homomorphisms and 
hi from to , induced by h, we have two logical choices for initialising the corresponding 
real circuit Oc, the states l^'o) and I^E'i). Which should we choose, and in either case what will 
the output look like? The answer to the latter question is given by the following lemma. 

Lemma 4. The images of |\i/o) and |\E'i) in the real circuit C are 

Ocl^o) =To®|<l>) = |<l>o) (24) 
Oc\^i) =Ti® 1$) = |<l>i) (25) 
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Proof. As in the proof of Lemma 1, all we require are the matrix multiplication rules of Equa- 
tion 12 



Re(f/c) 


Im(f/c) A 


( Re(|v^o)) 


-Im(f/c) 


Re(f/c) ) 


V Ml^o)) 



e(|*o)) + Re(^7c)Im(|*o)) 
^ / Re([/c|^o)) \ 

V -im(f/c|^o)) ; 

= ro® (t/c|*o)) 
= ro® 1$) = |$o) 

With the same method, we can obtain a similar expression for $i, i.e. 
Oc|*i) = (T®t/c)(ri®|*i)) 



(26) 



Re(f/c) 


Im(f/c) \ 


/ Im(|M/i)) 


-Im(f/c) 


Re(f/c) 


V Ml^i)) 



(27) 
□ 



Let us assume for a moment — and in fact, this is without loss of generality — that the original 
circuit was to be initialised with some base vector with a final state |$) = U\x). Again, 
there are two possible choices for initialising the corresponding real circuit, namely |xo) = |0)|x) 
and \xi) = |l)|x). What would then be the output of the simulated circuit in either case? In 
the very special case that |$) is also a base vector, then we would have |$o) = |0)|$) and 
|<l>i) = |1)|$), and thus, in either case, the bottom n-wires would contain the right answer and 
we can ignore the top wire. But when |$) is some arbitrary pure state, neither purely real nor 
purely imaginary, we cannot give such a nice semantic to the top wire. In particular, it might 
be entangled with the rest of the wires, and hence we cannot factor the final state. 

Nonetheless, what is surprising is that if we trace out the top wire, in all cases we will get the 
same statistics and furthermore that we will obtain the right statistics, i.e. the same as if we 
had used the original quantum circuit C. More formally, we have 

Lemma 5. Let 1$) be an arbitrary n-qubit pure state, and let po = Tri|$o) ('^'ol O'lT'd pi = 
Tri|$i)(<I>i| represent the partial traces obtained by tracing out (i.e. forgetting about) the top 
wire. Then we have that 



Po = Pi, 

Diag(po) = Diag(pi) = Diag (|$)(<l>|). 



(28) 
(29) 



Proof. The partial trace of the first wire of an arbitrary density operator given in block matrix 
form 

P -- 



A 


B 


C 


D 
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is given by, 



Tri(p) = [/„|0]p[J„|0]t + [0|/„]p[0|4]t 

= A + D (30) 

In particular, we have that 

\%){%\ = {%®m {%®\^))' 

which by applying transposition rules for block matrices and Equation 12 gives 



■M^) ( Re((*|) I Im((*|) ) 



Re(|$))Re(($|) 


Re(|<l>))Im(($|) 


-Im(|$))Re(($|) 


-Im(|<l>))Im(($|) 



and similarly for 

l$i)($il = (Ti®|<i>)) (ri®($iy 



-Im(|$))Im(($|) 


Im(|<|.))Re((<|.|) 


-Re(|$))Im((<l.|) 


Re(|$))Re((<l.|) 



By symmetry, we thus have the same expression for both partial traces 



(31) 



(32) 



po = Pi = Re(|$)) Re((<l>|) - Im(|<l>)) Im(($|) 

= Re(|$)($|) (33) 

Since |$) ($| is hermitian, its diagonal entries are all real, and therefore it has the same diagonal 
entries as po and pi. □ 

In other words, combining this with Lemma 4, we arrive to the conclusion that it does not 
matter what we set as the initial value of the top wire, |0) or |1). Furthermore, it is easy 
to verify that any 1-rebit state will do, whether pure or even totally mixed, as long as it is 
unentangled and uncorrelated with the bottom wires. 



3.4 Further Considerations and Consequences 
3.4.1 Complexity of simulation 

In general, if we initially have a (i-qubit gate, the new gate will be a ((i + l)-rebit gate. However, 
if Ug contains only real entries, then Og = I ^Ug, which means that in this particular case the 
top rebit need not be involved, and therefore the new gate is the same as the original. If the 
whole quantum circuit we are given is constructed with such real gates, then we are in luck 
and we do not require the extra rebit at all. In the general complex case, however, the circuit 
width is at most one more than that of the original circuit. 
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However, one non-negligible consequence of our simulation is that any parallelism that the 
original circuit may have had is lost after we serialise the circuit in Step 1 of the simulation 
algorithm. While it might be still possible to parallelise parts of the real circuit C (e.g. where 
we had real gates in the C), in the worst case, if all gates in C require complex amplitudes, 
then the top wire is always used and the circuit depth for C is equal to its gate count s. 
This is a consequence of our decision to reuse the same wire as the "top wire" for each gate. 
However, it is possible to reduce this depth increase at the cost of using several "top wires" and 
re-combining them towards the end of the circuit. This will result in only a O(logs) increase 
in circuit depth. 

Finally, as we have mentioned before, the overall classical pre- and post-processing requires 
little computational effort. Converting a description for the original circuit C into C requires 
time linear in the size of the circuit description, i.e. 0{s). Post-processing will be exactly the 
same as for the original quantum algorithm, since the statistics of measuring the bottom wires 
of C (or any subset thereof) will be exactly the same as those of measuring the wires of C, as 
per Lemma 5. 

3.4.2 Universality 

We knew already, from the previous results mentioned in Section 3.2, that it is possible to 
express any quantum circuit in terms of real gates only. If we had not known already that 
fact, we could have presumed that quantum circuits would be described and given to us in 
terms some universal set of gates containing at least one non-real, complex gate. In that case. 
Theorem 2 would provide a proof that a real universal set could be constructed, simply by 
replacing any non-real gates by its image under h. 

One advantage of this technique is that it does this conversion with very limited overhead in 
terms of width, requiring 1 extra rebit for the whole circuit, and not an extra rebit for every 
substituted gate, as might have been expected. In addition to its usefulness in Section 4, this 
is one of the reason that we believe that this particular version of the equivalence theorem is 
interesting of its own, when compared to previously known results. In particular, the fact that 
it provides a much tighter bound on simulation resources needed, might prove useful in the 
study of lower quantum complexity classes and possibly in quantum information theory. 

3.4.3 Interpretation 

With Lemma 5, we are left with a curious paradox: while we require an extra rebit to perform 
the simulation, we do not care about its initial or its final value. In particular, it can be 
anything, even the maximally mixed state. So, what is this rebit doing? 

Let Hq and Hi be the orthogonal subspaces, each of dimension A^, spanned by the |6o) and 
base vectors of Equations 18 and 19, respectively. If a state |$) has only real amplitudes then 
|$o) ^ Hq and |$i) G Hi. For a generic 1$), however, |$o) and are not contained in either 
subspace, but in the space spanned by both, i.e. the complete rebit space H^. In that case, the 
top rebit will not be just |0) or |1) but some superposition thereof. 
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In other words, it somehow keeps track of the phase (angle) of the representation of 1$) in 
rebit space with respect to these subspaces. The CNOT gate (or any other real gate) does not 
change this phase factor. However, as arbitrary gates with complex transition amplitudes affect 
this phase factor, their effect is simulated by "recording" this change in the top rebit. How 
we initialise the top rebit gives an arbitrary initial phase to the representation of |$), but 
as we saw, this initial phase does not affect statistics of the bottom wires, and thus can be 
set to any value. However, how this phase has been changed by previous complex gates will 
affect the bottom rebits in subsequent complex gates, in a similar fashion as the phase kickback 
phenomenon in many quantum algorithms^. That is why that top rebit is needed. 

4 Quaternionic Computing 

This section closely mimics Section 3. First we define what we mean by quaternionic computing, 
making sure that it is a sensible model. We then prove an equivalence theorem with quantum 
computing, by using the same techniques as those of Theorem 2. 

4.1 Definitions 
4.1.1 Quaternions 

Quaternions were invented by the Irish mathematician William Rowan Hamilton in 1843, as a 
generalisation of complex numbers. They form a non-commutative, associative division algebra. 
A quaternion is defined as 

d = Co + aii + + ^sk (34) 
where the coefficients a are real numbers and i, j, and k obey the equations 

ii=jj=kk = ijk = -l (35) 

Multiplication of quaternions is defined by formally multiplying two expressions from Equa- 
tion 34, and recombining the cross terms by using Equation 35. It is very important to note 
that while all non-zero quaternions have multiplicative inverses they are not commutative 
Thus, they form what is called a division algebra, sometimes also called a skew field. 

The quaternion conjugation operation is defined as follows: 

a* = aQ — aii — ~ 0,3^ (36) 

where for clarity, we represent with the (non-standard) symbol (*) in order to distinguish it 
from complex conjugation represented with (*). With this conjugation rule, we can define the 
modulus of a quaternion as 



\a\ = Vaa* = y al + al + al + al (37) 

^With the noticeable difference that phase kickback would not work if the top qubit were maximally mixed... 
^Whilc the square roots of —1 arc anti-commutative, e.g. ij = — ji, this is not true in general, i.e. af3 ^ —/3d;. 
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Furthermore, the usual vector inner product has the required properties (i.e. it is norm defining), 
and a proper Hilbert space can be defined on any quaternionic hnear space. 

It is also possible to complexify the quaternions, this is, to represent them in terms of complex 
numbers only. Let a be an arbitrary quaternion, then we define its complex and weird parts as 



Co(d) = ao + ai'i (38) 
Wd(d) = as + agi. (39) 



We can then decompose a in its complex and weird part as follows: 

a = ao + flii + + a^k 
= (ao + aii) + (a2 + aai)] 

= Co(d)+Wd(d)j (40) 
This equation allows us to derive multiplication rules, similar to those of Equation 11 

Co{ap) = Co(d)Co(/3) - Wd(d)Wd*(/3) 

Wd(a/3) = Co(d)Wd(/5) +Wd(d)Co*(/5) (41) 

where we define Co*(d) = [Co(a)]*, and similarly for the weird part Wd*(«) = [Wd(d)]*. It is 
interesting to note how the non-commutativity of quaternions is made apparent by the fact that 
neither identity in Equation 41 is symmetric with respect to a and /3, unlike their equivalent for 
complex numbers (Equation 11), because in general Co*(d) 7^ Co{a) and Wd*(d) 7^ Wd{a). 
We can also rewrite Equation 37 for the modulus as 



\a\ 



V|Co(a)|2 + |Wd(d)|2 (42) 



which is very similar to the modulus definition for complex numbers. Finally, we have the 
following useful identities 

Co(d*) = Co* (a) 

Wd(d*) = -Wd(d) (43) 



4.1.2 Quaterbits 

Similarly as in quantum information theory, we can define the quaternionic equivalent to the 
qubit, as the most elementary quaternionic information system, the quaterhit. ^ 

Definition 3 (Quaterbit). A quaterhit is a 2-level system with quaternionic amplitudes. It 
can be represented by a unit vector |$) in a 2-dimensional quaternionic Hilbert space, i.e. 

1$) = «|0) + /3|1), s.t. II $ II2 = \l\a\^ + W (44) 
up to an arbitrary quaternionic phase factor. Indeed, we have that 

$ = $' <^=^ 1$) = i7|<l>'), where I17I = 1. (45) 
^The name "quits" has also been suggested [17] and abandoned... 
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The canonical values of the quaterbit correspond to the canonical basis |0) and |1) of that 
vector space, and are given the same semantics just as before. Similarly, we can define n- 
quaterbit states, with the same canonical basis as for rebits and qubits. With this definition, 
the measurement rule in Equation 1 is still sound and we adopt it axiomatically. 

Quaternions are often used in computer graphics to represent rotations of the 3D Euclidean 
space. However, contrary to rebits or qubits, we have not found a nice geometric interpretation 
for the state space of even a single quaterbit. 



4.1.3 Quaternionic Circuits 

For the sake of clarity, let us distinguish the conjugate transpose operation for quaternion and 
complex matrices by representing them differently with the ("'■) and {}) symbols, respectively. As 
before, the only relevant linear transformations Q that preserve I2 norm on this vector space are 
the quaternionic unitary transformations, which have the same property = as complex 
unitary transformations. They form the so-called symplectic group which is represented as 
Sp(iV). 

Thus armed with linear, inner-product preserving operations, we can in principle define quater- 
nionic circuits in a similar fashion as we defined quantum and real circuits. Unfortunately, we 
cannot apply the same definition of computation semantics as before, and thus cannot define 
quaternionic algorithms in the same way either. The reason is simple and quite surprising: the 
output of a quaternionic circuit is not uniquely defined! 

To see this, consider the following property of the matrix tensor product, i.e. the distributivity 
of the tensor product with the regular matrix product. 



(A®B)-(C®D) = (A-C)®(B-D) 



(46) 



where A, B, C, D are arbitrary matrices. This equation is in general true for any commutative 
semiring and for non-commutative semirings only if C and D are 0-1 matrices. 



A 



A 



B 



D 



B 



D 



Figure 6: Effects of quaternionic non-commutativity on quaternionic circuits. The operator for 
the circuit on the left is obtained by combining the operators "vertically" , by taking the tensor 
product first; this corresponds to the operator on the left side of Equation 46. The operator for 
the right circuit is obtained by combining them "horizontally" first, and gives the expression 
on the right hand side of the same equation. 
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Suppose now that the matrices A, B, C, D correspond to the gate transformations in the circuit 
depicted in Figure 6. Then, the fact that Equation 46 does not hold means that the two 
different ways shown there of combining the gates will yield different operators for the circuit. 
Furthermore, even if we initialise in both cases with the same input, we will obtain different 
output statistics. 

To further illustrate this paradox it is useful to think of the states of a circuit in terms of 
temporal cuts in the circuit graph (see [10] for a more detailed description of this formalism). 
We can think of the set of all possible states of a given circuit graph as its discrete "space-time 
continuum." The circuit topology defines an ordering on this set that is naturally associated 
with a state being "before" or "after" another. It is however only partially ordered as some 
states are temporally incomparable, i.e. those corresponding to cuts of the graph that cross each 
other. Each topological sort of the circuit graph is one of the many possible total orderings 
of the set of cuts, or in other words a chain in the poset (partially-ordered set) of cuts, also 
corresponding, as we saw in Section 3.3, to an evaluation sequence of gates. In more physical 
terms, each of these chains or total orders corresponds to a possible path in the space-time 
continuum of the circuit. 

When Equation 46 holds, we are guaranteed that the overall operator over each and all of these 
paths will be the same. However, in the case of the quaternionic circuits, we can expect each of 
these paths to give a different answer. Which of these many paths (for a poly-size circuit, there 
are exponentially many of them) is the "correct" one? Which one is somehow privileged by 
nature? Which one should we choose to be the "computational output" of the circuit? The fact 
is that we do not know how to resolve this ambiguity, and without doing it, it is not completely 
clear what "the" model of quaternionic computing should consist of. 

We can get out of this impasse by allowing for a "parametrised" notion of a quaternionic 
algorithm. 

Definition 4 (Output of a Quaternionic Circuit). Let C be a quaternionic circuit of size 
s and let 0" = ((Ti, . . . , as) represent one of the possible topological sorts of the corresponding 
circuit graph. We denote by Qa the operator of the circuit C under a, which is obtained when 
the gates are combined one-by-one following the ordering in a, i.e. 

where g'-*^ is the (in-context) operator corresponding to the i-th gate in a. 

Definition 5 (Quaternionic Algorithm). A quaternionic algorithm is defined as a classical 
TM, which on (classical) input x will generate a (classical) description of a quaternionic circuit C 
and a (classical) description of one of its possible topological sorts a. The result of measurement 
of the final state |$) = go- I^E'o); where |\E'o) is the default initial state, is then post-processed 
by the TM to produce its final (classical) answer. 

Relative to this somewhat unsatisfying notion of quaternionic computation, we are still able to 
obtain the following equivalence result. This theorem is the main result of this article, and its 
proof is very heavily inspired from that of Theorem 2. 
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Theorem 4. Let C be any n-quaterbit circuit of size s, composed of gates of degree at most d 
and let a be any topological sort of C . Then, there exists a quantum circuit of n + 1 qubits, 
employing the same number of gates, each of degree at most d + 1, that exactly simulates the 
operator Q„ . 



4.2 Proof of Main Theorem 
4.2.1 More Group Theory 

As before, the proof is based on the (lesser known) fact that Sp(A^) can be embedded into 
SU(2A^). We provide a mapping from one to the other, which is very similar to the one from 
SU(iV) to S0{2N). 

The mapping h from Sp(A^) to SU(2A^) is defined similarly to the one from SU(A^) to S0(2A^) 
given in Equation 8 



qAu = h{Q) 



Co(g) 


Wd(Q) 


-wd*(g) 


Co*(g) 



(47) 



or equivalently in its tensor form, as in Equation 9 

^T®Q (4J 



Co Wd 

-Wd* Co* 



At this point, what we need to show is that this h is also a group isomorphism, in other words 
the equivalent of Theorem 3. 

Theorem 5. Let Gjy represent the image ofSp{N) under h. Then h is a proper group isomor- 
phism between Sp(A^) and G^, and G^ is a subgroup ofS\J{N). 

Thanks to the tensor formalism, we do not need to construct the proof in full detail, as we did 
for Theorem 3. The only thing we need to show are equivalent statements to those of Lemmas 
1 and 2. 

Lemma 6. Let A and B be any two arbitrary N x N quaternion matrices, then h{AB) = 
h{A)h{B). 

Proof. As before, it is simple to verify that the quaternion multiplication rules in Equation 41 
also generalise to any multipliable quaternionic matrices A and B. Thus we have that 

h{A)h{B) = {f®A){f®B) 



Co{A) 


Wd(A) \ 


( Co{B) 


Wd(B) 


-Wd*(A) 


Co* {A) J 


\ -Wd*{B) 


Co*{B) 



Co{A) Co{B) 


-Wd{A) Wd*{B) 


Co{A) Wd{B) + Wd{A) Co*{B) 


-Wd*{A) Co{B) 


- Co* (A) Wd*{B) 


- Wd*{A) Wd{B) + Co* (A) Co* (5) 



Co Wd 
Wd* Co* 



®AB = f®AB = h{AB) (49) 



22 



□ 



Lemma 7. Let A be an arbitrary N x N quaternion matrix, then h{A^) = h{A) . 

Proof. Similarly to the proof of Lemma 2, we require the following matrix identities, which are 
easily verified 



Co(A*) = Co(A)t 
Co*{A^) = Co*iAy 
Wd{A^) = -Wd*(yl)^ 



(50) 



We then have that 



Co{A) 


Wd{A) ~ 


-Wd*(A) 


Co*{A) 



Co(A)t 


-Wd*(A)t 


Wd(A)t 


Co*(A)t 



Co(At) 


Wd(A*) 


-Wd*{A^) 


Co*{Ai) 



f®A^ = h{A^) 



(51) 
□ 



4.2.2 The Simulation Algorithm 

Let C* be a quaternionic circuit composed of s elementary gates of at most d quaterbits, let a 
represent a path in its space-time continuum (i.e. one of its possible total ordering or equiv- 
alently a topological sort of the circuit graph), and let Q„ be the corresponding quaternionic 
linear operator. Then the quantum simulation algorithm for C under a will be very similar to 
that described in Section 3.3. 

Step 1 Serialise the given circuit C according to a, i.e. such that = Q^'^^Q^'^~^^ . . . Q^'^^Q^^\ 



Step 2 For each gate g G in the ordering defined by a, let (71 < ■ ■ ■ < gd be 

the wires on which the d-axj gate Qg acts. Replace Q^^^ with f/^^^ the appropriately 
padded {n + l)-qubit operator for the quantum gate Ug = h{Qg) acting on wires 
gi + 1 < ■ • ■ < gd + 1 and the top qubit wire. 

Step 3 Construct the overall quantum circuit C by concatenating the circuits for each level 
g, in the same order as defined in Step \. That is, if U is the operator for C, then let 

f/ = f/W...f/(2)f/{l). 

Step 4 Write a description of the quantum circuit C and of its (classical) input state and ask 
the quantum computing "oracle" to provide the result of a measurement on its final 
state. 
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Step 5 Perform exactly the same classical post-processing on the result as the original quater- 
nionic algorithm. 

The construction of the circuit as described in Section 3.3 is purely formal, and does not depend 
at all on the actual gates and operators. In particular, other than circuit operator algebra, the 
proof of Lemma 3 only required that h he a. group isomorphism, fact which we have already 
established for h. Thus we can claim the following equivalent lemma. 

Lemma 8. The inverse image of U is precisely Q, i.e. U = h{Q). 
4.2.3 Initialisation and Measurement 

We can maintain the same semantics for |$o) and such as defined in Equations (18) 

and (19), by the using the columns Tq and Tl of the new tensor T = [TqI^], 



1$) h |$o) ^ to ® 1$) = j ® 1$) (52) 

h A ® 1$) = ^ ® 1$) (53) 

With these definitions, we have the same base cases for setting the top wire, thanks to the 
following lemma, equivalent to Lemma 4. 

Lemma 9. Let \^) he any n-quaterhit state, then we have that the images of |\E'o) and in 
the quantum circuit C are 

U\^o) =to®|$) = |$o) (54) 

f/|^i) =ti®|<l>) = |<l>i) (55) 

Proof. With the quaternion matrix multiplication rules obtained from Equation 41, we have 

U\^o) = {T®Q){%®\^o)) 



Co(g) 


wd(g) \ / 


^ Co(|vl/o)) 


-wd*(g) 


co*(g) ; \ 


^ -Wd*(|*o)) 



Co(g) co(i^o)) - wd(g) wd°^(|^o)) 
-wd*(g)Co(|^o))-Co*(g) wd*(|^o)) 

co(g|vi>o)) 
-wd*(g|M/o)) 

%®{Q\m^)) 

To ® 1$) = |<l>o) (56) 
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And similarly for $i, i.e. 



Co(Q) 


wd(g) \ 


( Wd(l^i)) 


-Wd*(Q) 


Co*{Q) J 


V Co*(|M/i)) 



= Ti ® 1$) = (57) 

□ 

Finally, we need to show that as before we can initialise with any qubit value in the top wire, 
ignore it at measurement, and still get the same statistics as we would have with the original 
quaternionic circuit. For that, we have to show that the equivalent of Lemma 5 is still true. 

Lemma 10. Let 1$) he an arbitrary n-quaterhit state, |$o) '^^'^ I'^'i) images under ho and 
hi, and po (^iT'd pi he their respective partial traces when the first quhit wire is traced out. Then, 

Diag(po) = Diag(pi) = Diag (|$) ($1). (58) 
Proof. The expressions for the non-reduced density operators are given by 
|to)(tol = (t„»|4»(t„»{t|)t 



Wd°(|t)) ) ( Co({4|) I Wd((*|) ) 



Co(|$))Co((<|.|) 


Co(|$))Wd((<l>|) 


-Wd*(|<l>))Co((<l>|) 


-Wd*(|<l>))Wd((<l>|) 



(59) 



and similarly. 



|<l'i)($i| = (Ti® 1$)) (Ti® ($1)1 



- Wd(|<l>)) Wd*(($|) 


Wd(|$))Co*(($|) ~ 


-Co*(|$))Wd*((<l>|) 


Co*(|<|.))Co*(($|) 



^pyf) ( -wd*(($|)|Co*(($|) ) 

(60) 

As before, the reduced density operators are the sum of block matrices in the diagonal, which, 
unlike in Lemma 5, are not the same in both cases. However, the i-th entry in the diagonal is 
given by 

(z|poK) = (2| [Co(|$))Co(($|)-Wd*(|<l>))Wd((<|.|)] 1^) 

= (z| Co(|<l>)) Co(($|)|z) - (z| Wd*(|$)) Wd(($|)|i) 
= Co($i) Co($^) - Wd*(<l>i) Wd($*) 
= Co($,) Co*(<l>,) + Wd*($0 Wd($i) 

= |Co($,)P + |Wd($,)p=|$,P (61) 



25 



where is the i-th coordinate of |$), and we use the properties of Co and Wd in Equation 43. 
We also have, 



(2|pi|2) = (z| [-Wd(|<l>))Wd*((<l>|)+Co*(|<l>))Co*(($|)] 

= -(z|Wd(|$)) Wd*(($|)|z) + (z|Co*(|$)) Co*((<l>|)|z) 
= - Wd($i) Wd*(<l>^) + Co*(<l>,) Co*(<l>^) 
= Wd($*) Wd*($*) + Co($^) Co*($*) 

= iwd($*)r + ico($nr = r = (62) 

□ 

5 Considerations and Consequences 
5.1 Complexity of Simulation 

In terms of simulation resources, the situation is similar to that of real computing. Circuit 
width is increased by only one, but circuit depth can be equal to the circuit size in the worst 
case. 

For circuit size, however, we have to make a slight distinction. While the number of {d+ l)-ary 
gates in the new circuit will be the same as the number of rf-ary gates in the original circuit, 
one might not be satisfied with this type of gate count complexity for the quantum circuit, 
given that we do not know d and that we have very small universal gates for quantum circuits. 
In general, if we suppose that the original circuit given to us is constructed with some set of 
universal gates, then the simulation will depend on d, the number of quaterbits in the largest 
gate in the universal set. In particular, if c? > 3 we might require to decompose such a gate Qg 
into a set of elementary 3-, 2- or 1-qubit gates, universal for quantum computing. 

We can assume without loss of generality that we are given a full description of Qg in terms of 
its 2"^ X 2"^ quaternion matrix. We can then use the generic method for decomposing the matrix 
for the image quantum operator Ug = h{Qg) into our set of elementary gates. Since Ug is a 
2d+i ^ 2"^+^ matrix this might require 0(2'^) time, and furthermore up to 2'^"'"^ elementary gates 
might be required to decompose Qg. 

If a "nice" universal set is being used where d is a small constant, then this decomposition will 
occur in 0(1) time and will produce 0(1) extra gates. Hence, we have that the total gate count 
is not exactly n, but is still in 0(n). The circuit depth which could already be as large as s, 
could be increased further by gate decomposition, but again, only by a constant factor. 

While we have not gone through the exercise of looking for a finite universal set of elementary 
gates which would be computationally universal for the symplectic group, we believe that one 
exists. Even without the luxury of a finite universal set, it would be in principle sensible to 
define a computational model using quaternionic gates, as long as the description of all circuits 
(and their gates) is of limited size and can be uniformly generated. In fact, our results do 
not need the existence of a universal set; they just would make the computing model more 
"realistic." 
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On the other hand, let us also consider a variety of quaternionic circuits which includes gates of 
arbitrary degree — since we cannot show a "nice" universal set with constant degree gates, let us 
do so for the sake of completeness. In that case, if the circuit description has size polynomial in 
n, then the description of Qg must also be of polynomial size, and this puts an upper bound on 
d, i.e. d = O(logn). Thus, in the worst, case, we can have that each Qg will require 2'^'^^ = 0{n) 
elementary quantum gates, all in series, with a resulting 0{n) depth and size overhead for each 
gate. Computing these decompositions would take time at most 0{n) per gate. We summarise 
these results in Table 1. 





Quaternionic circuit 


Quantum circuit 


width 


n 


n + 1 


size 


s 




depth 


t 





Table 1: The overall resources needed to simulate a quaternionic circuit built with d-ary gates, 
with a quantum circuit built with 2-ary gates. 

We stress the fact that this is a worst case scenario due to the fact that we cannot bound d by 
a constant, as we have not yet shown any universal set of quaternionic gates. If we did, then 
d = 0(1), and the results would be the same, up to a constant, as those for Theorem 2. 

5.2 Interpretations of the Quaternionic Model 

Because of the similarity of the constructions of Theorems 2 and 4, we can give similar inter- 
pretations to the role of the extra required top qubit. More concretely, if we label the basis of 
the 2iV-dimensional complex Hilbert space as \bc) = hodb)) and |6^u) = hi{\b)), and order them 
accordingly, we can give the same semantics to the extra wire required by the simulation. This 
is, the extra qubit is at the top of the simulating circuit, and in a similar way as before keeps 
track of the "phase" information between both orthogonal subspaces of the complex Hilbert 
space spanned by the \bc) and base vectors. In this case, however, this information requires 
the full "power" of a qubit, and not just a rebit. This is due to the fact that the phase infor- 
mation is defined by a unit quaternion, which cannot be represented by just one angle (as is 
the case for a unit complex number). 

We can infer, that with this same method it is not possible to simulate an n quaterbit circuit 
with only n + 1 rebits. The following corollary, however, shows that just one extra rebit is 
sufficient. 

Corollary 1. Any temporal chain a of an n-quaterbit quaternionic circuit can be exactly sim- 
ulated by an {n + 2) -rebit real circuit. 

Two proofs are possible. First, we can simply combine the results of Theorems 2 and 4. 
More interestingly, however, a direct proof is possible by using the standard representation of 
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quaternions as 4 x 4 real matrices, which suggests the following tensor S 



S 



V 



Re 
-Im 
Km 

Jm 



Im 
Re 
Jm 
Km 



Km 

- Jm 
Re 

- Im 



- Jm \ 
Km 
Im 
Re / 



(63) 



where Jm(d) = 02 and Km(d) = 03 are the "other" imaginary parts of quaternion a. This 
tensor induces a group isomorphism from Sp(A^) to S0(4A^), which has all the properties 
required for the simulation to be sound. 

Within the context of this simulation, the fact that the output of a quaternionic circuits de- 
pends on the order of evaluation of the gates becomes painfully obvious. Consider a simple 
quaternionic circuit with two 1-quaterbit gates A and B acting in parallel on two separate 
quaterbits. Let us consider a complex qubit simulation, as illustrated in Figure 7, where A' and 
B' are the in-context unitary complex operators simulating them, respectively. Since we do 
not expect A' and B' to commute in general, the simulation will produce two different results, 
depending on which of the two gates is placed before. 





(a) 



(b) 



Figure 7: More effects of quaternionic non-commutativity on quaternionic circuits. In this sim- 
ple 2-quaterbit example, the global circuit operator will depend on whether gate A is executed 
(a) "before" or (b) "after" gate B. 

However, since we have shown that the simulation is always accurate, this ambiguity exists 
even if no simulation ever happens. In other words, this non-local time dependence is a natural 
property of quaternionic systems. 

From an Information Theory point of view, this non-local dependence resembles entanglement, 
except that it somehow comes for "free" : even if the initial state is completely uncorrelated 
and unentangled (i.e. a product state), we can obtain a global state in which this depen- 
dence exists even without performing any non-local operations. The same is not possible in 
standard complex-based qubits Quantum Information Theory. Even though we dare not call 

^"Not to be confused with free entanglement, a term which is sometimes used to refer the opposite of bound 
entanglement. 
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this curiosity "entanglement" , the fact is that it has important consequences on a quaternionic 
Information Theory. In particular, it would allow Alice and Bob to perform a reasonable form 
of unconditionally secure bit commitment. In our example, let Alice and Bob hold two com- 
pletely unentangled and uncorrelated quaterbits. At commitment time. Bob announces that 
he is about to do operation B on his quaterbit. Alice then commits to or 1 by performing A 
on her quaterbit before or after Bob does B, respectively. The commitment is opened by Alice 
and Bob putting their quaterbits together and measuring them and verifying which of the two 
possible outcomes came out. Of course. Bob cannot open before Alice provides her quaterbit 
and Alice cannot change her mind after Bob has done B without it being detected by Bob at 
the opening stage. Again, this is not possible in the standard Quantum Information Theory 
[16, 15]. 

According with current attempts to reformulate Quantum Mechanics purely in terms of In- 
formation Theory ([11, 12], among others), the possibility of bit commitment alone would be 
sufficient to rule out quaternionic models as reflecting physical reality. However, even if this 
programme does not succeed, one might argue that the non-local time- dependence exhibited 
by quaternionic models is in itself non-physical enough to rule them out, as they can be seen 
as a mild violation of the usual causality properties of Nature. 

6 Final Conclusions and Further Questions 

The possibility of a quaternionic speed-up? 

We have shown how a somewhat sensible model of computing can be constructed using quater- 
nionic amplitudes. A crucial characteristic of this model is that due to the non-commutativity 
of quaternions, the output to the circuit will depend on the "evaluation path" of the circuit, as 
there is no unique circuit operator for all possible ways of re-combining the gates. However, any 
such ordering of gates generates a well defined output, which we have shown can be simulated 
exactly and efficiently by a quantum circuit of similar size and width. This was our main result, 
which was inspired on a new proof we constructed for the equivalence of complex and real cir- 
cuits. Despite this somewhat strange and unexpected parametrised definition of quaternionic 
computing, what this result in essence tells us is that all of these paths along the space-time 
continuum somehow have the same computational power, and furthermore that they can be 
independently simulated in an efficient manner by a standard quantum computer. 

We can interpret Theorem 4 as a general result on quaternionic physical models as follows. If 
somehow Nature chooses and prefers one of the possible paths of evolution through the state 
space, then Nature's behaviour on such quaternionic systems can be efficiently simulated by a 
quantum system of similar complexity. This, provided that we somehow know which path is 
preferred. If this were indeed the case (for example, because the physicists would tell us so), 
we complexity theorists might rub our hands together in satisfaction and further sing to the 
robustness of the BQP complexity class. 

But what if Nature somehow did not prefer nor chose one these paths, but somehow followed 
them all at the same time. Would there be any mechanism by which the results of the different 
computations would be weighed (by probabilities or probability amplitudes)? Could these paths 
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interfere with each other, in a similar fashion as in quantum models? Destructively? And if it 
were the case, could we ultimately harness such extra parallelism to achieve a speed-up beyond 
those achievable with quantum computation models? 

Ruling out quaternions and real numbers 

The skeptics and realists among the readership might argue that all of these questions are 
completely sterile and void of interest. Despite the fact that a quaternionic version of Quantum 
Mechanics has been proposed [2], we do not really know where or how Nature would exhibit such 
"quaternionic behaviour" and even less how to harness it. If our only objective was to one day 
build a quaternionic computer, we the authors would be the first to agree with such skeptics. 
Nevertheless, we believe that one of the major contributions of this work has been to find and 
identify a simple and easily explainable potential reason why there should not be quaternion 
amplitudes involved in Nature: the asymmetry of the possible evolution paths between two 
space-time events, even without relativistic effects. This, the physicists might argue, is the 
violation of some fundamental principle, and hence not possible or likely. From an information 
theoretic point of view, the possibility of bit commitment provide reason enough to rule them 
out as a "natural" model. These realisations are also in line with work of Lucien Hardy [13], 
that also displays the unnaturality of quaternionic or real models by considering the number of 
degrees of freedom involved in the composition of such sub-systems. 

From a purely information-theoretic point of view, there was already some work including 
that of Caves, Fuchs and Schack [9] and Vlasov [20] identifying some non-trivial differences 
between standard and real-number or quaternion-based Information Theory. In the context of 
this paper, it is interesting to note that the converses of Theorems 2 and 4 are not necessarily 
true. Not all (n + l)-rebit/qubit circuits can be simulated by n qubit /quaterbit circuits, which 
stems from the fact that h and h do not span the whole S0(2A^) and Sp(2A^), respectively, as 
a simple counting argument shows. While from a complexity point of view requiring one extra 
qubit /rebit is not a big deal, this asymmetry between the models might make a difference in 
other quantum information processing tasks. For example, one might ask how many classical 
bits are required to teleport a quaterbit, whether using quaterbits affects the various quantum 
channel capacity measures, how communication complexity is affected, etc. Furthermore, while 
we have concentrated here our discussion on departures of the quaternionic models from the 
standard one, the same fundamental questions as above can be asked of rebits. We believe that 
a study and discussion of the real number case would also be interesting and shed even more 
light on the physicality (or lack thereof) of such models, quaternionic or real. 

Completing the Algebraic "Big Picture" 

Finally, we believe that the continued study of non-standard algebraic models such as these, 
based for example on the octonions or even possibly finite fields will also bear fruits in that 
direction. More concretely, we hope that, at the very least, we might be able to provide more 
examples of "weird properties" which might discount these models as "unnatural", doing so 
more easily in the language of Information Theory than in that of Physics. 
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From a Complexity Theory point of view, this would also be of value in further completing 
the algebraic "big picture" of complexity classes painted in [5, 6]. This picture, so far, gives 
evidence of how little the actual amplitude structure does to change computational power, 
and further points to what we believe is the ultimate cause for the "quantum speedup", the 
possibility for probability amplitudes to destructively interfere. 
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